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INTRODUCTION 



For any system to survive, the users must be satisfied that it meets 
l Ivc i r needs promptly and consistently. 

Especially in the initial pci iod of testing f. user's profile, it is 
essential to react quickly to any undos i red result such as an extremely 
high or low output, too low a relevance or recall, or both . Otherwise the 
user will he discouraged and t lie service discredited. If we are not to 
oversell the system, we must maintain a good standard quality of service, 
or t I k? disappointed user will not pay for it. 

It is clear that the search editor should feel himself responsible 
not only for the profile setup but also for its continual adjustment. 

If lie is to be able to take the proper steps, lie must knw both what 
to do and low to do it. His knowledge must cover three areas: 

1. The user’s exact information need. 

2 . The user's relevance and recall, and which, if either, he 
prefers to improve. 

5. The means to achieve the desired objectives. 

Let us follow these points, making use of the information anti 
"knowhow" compiled in our previous work (1). 
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1 1 PURPOSE 



Our OJMPliNPEX Profi ling Guide provided guidelines for formulating 
profiles. Now, with the growing nunber of users, the need Ivis arisen for 
a CCMPUXDIiX Profile Adjustment Manual. 

Iliis aid is designed mainly for search editors arxl for a limited 
mrnber of users who wish to adjust their own profiles. 

Note : COMPliXDEX in this manual is the designation of the computerized 
version of t lie ling i nee ring Index Monthly, supplied by the Engineering index 
Inc., 345 East 47th Street, New York, N'. Y. 

TliXM’AL*, as referred to in this manual, is IBM's systan whose main 
author is l'r. Samuel Kaufman of IHM. 

We use this systan to process the Engineering Index tapes for both 
current awareness and Retrospective Search. 
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Uc must always ensure tlut the user's profile is congruent with his 
actual information need. While this might scan to be a self-evident 
postulate, it is lx? t so easily achieved. 

ihe various situations that can occur may be illustrated as follows: 

* the Information Need 

s the C^iest ion 

1. The question sixxild cover perfectly tlx? information need. Hie 
user gets the pertinent information. 

<SX 

2. The question is formulated more broadly than the actual information 
need; i.c., Q is larger than N. The user will thus get irrelevant 
information for the area of Q in excess of N\ Relevance is low. 




5. The question constructed does not cover all of the information 
need; i.e., Q is smaller than X. Tlx; user will be missing tl»o 




information for the area of N in excess of Q, or, recall is 
too low. 

4. In the practical informal' ion-retrieval envirorment, we often 
meet a situation like this: 

N) 

Ik n, tlx' information need anil the question partially overlap. 



'Ifi is implies that the user will obtain pertinent hits only for 
the area of overlap. Ihe remaining area of Q represents the 
amount of irrelevant information retrieved; the remaining area 
of N, the relevant information not retrieved. 

S. The extreme unfavourable condition is shown in the configuration 
below, where Q and N fail to completely coincide in any particular. 



It is obvious that (1) represents the most desirable situation. There 
is, however, another condition which must be met: both N and Q must fall 

within the area of so-called "a priori relevance ’or’ mega- relevance." 



If the data base is not capable of answering a question satisfactorily, we 
say it has a low a priori relevance. 

lor example, llblPKNUIX data base would not be pertinent to any question 
of a historical or philosophical nature. In many cases it will be partly 
suitable. 

A priori relevance is the first thing to consider l>eforc going any 
further in t he profile setup. The coverage of the data base (subject areas 
covered, d tv uncut s abstracted) will provide an answer here. 

Once a profile has been set up, both user and search editor are 
responsible for its continued adjustment in the light of previous runs. All 
users should report immediately any change in their information need. Khcn 
a user marks hits as irrelevant, although the information was correctly 
selected as a match for his profile, it may be that he has changed his 
subject area of interest and failed to lot us know. In such case the user 
should be asked innediately to cither revise or revrite his profile. 
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In CCMPI.NIO service, by using T1.XT-1WC we arc able to increase either 
relevance at the expense of recall or recall to the detriment of relevance, 
or to operate with reasonable degrees of both. Qir ("CMI'LMiliX/lirXT-I’AC-CIS 
Project Report (1) will provide any desired details concerning relevance 
and recall evaluation, system performance, etc. 

Relevance is a very useful indicator of tin? system performance. There 
is, however, a big difference between a relevance of G0£ tliat represents 
b out of 10 abstracts, and a relevance of 60» that represents COO out of 
1000. hhcrc 4 irrelevant abstracts are negligible, TOO become a massive 
inconvenience. I he specific amount of "trash" (irrelevant retrieved) must 
be indicated, then, to make the relevance figure really meaningful. 

Recall, t lough a natural counterpart of relevance, may be rather 
difficult to determine. If ascertained, it is very appropriate for it to 
be accarjvini ed by the lumber of ’blisses" (relevant twit retrieved). 

It is an inherent property of relevance and recall that, once a 
certain standard of performance has been readied, we can improve either 
only at the expense of the otlier. 

he have seen tliat to truly characterize the {vrfomance of a profile 
we must relate relevance with "trash" and recall with ’blisses." However, 
in older to hive a complete picture of a profile, we need to know which the 
user prefers: high relevance, high recall, or mediiri degrees of loth, he 
can then make a responsible decision about what to do with the profile, and 
find the most effective steps to attain this goal. Ke rcccmend asking the 
users to supply this information at their earliest convenience so that we 
can monitor the output of their profiles in the desired way. 

If the status of a profile run looks like this (three months* average) 
| ITofilc *800110 (Relevance oriented) Rol .V),T70; Rec 70, 'VS; 



we have to mnrovc relevance. 

On the other lund, 

Profile SOOltJo (Recall oriented) Rcl 85,110; Roe io'^Vu;' | 
calls for a radical improvement in recall. 

It is a pi it that a medinn-or iented user would he content with this 

situat ion: 



1'rofilc 800 i i i pMedi m oriented) Hel 59,m; Roe S2,M30. 



(Xir ,^ion should not be based on a random result of a month, but 
on an at - i . of several months, he will find, however, that if a profile 
remains *nged and the coverage in the data base is fairly steady, the 
output pi oKc/recall ratio will not vary significantly. 

To 1- p track of the profile performance we suggest tint the following 
fomat ! . 1 ered to (Pig. 1): 



1970 



NAMI; 


nuriLF mi. 


pRiiauixck 


JAN 


FEB 


MAR 


APR 


MAY ... 


NOT I. 


S' 1 1 1 H 
J. F. 


800108 


RIP 


75 


15 


















40 


60 
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lig. 1 * Profile Performance Pom 



Ihis record rreans tint our user, 5*11111 J. F., profile raerber 800108, 
indicated relevance as the preferred parameter in viewing his output. In 



January 1970 his profile had attained relevance of 75% and there were 15 
irrelevant retrieved abstracts (trash). Recall was found to be approximately 
40%, which means that some 60 relevant abstracts were not retrieved. Hach 
month's record has the following format: 



Rel 


Trash 


Rec | 


Miss 



The degree of relevance will be established easily. In the profile 
800108 above, we have received an output of 60 cards in the January run. 

We sent this output to the user and we got his feedback. He marked 45 
response cards "relevant" by pushing out the appropriate box. The remaining 
15 response cards represent the irrelevant portion of the output. As 45 
relevant abstracts make 75% of the total output of 60, we say the relevance 
equals 75% and we note this figure in the upper left-liand corner of his 
profile in the January column. As 15 abstracts retrieved were irrelevant 
(trash), we write this figure in the upper right-hand corner. 

Recall is more difficult to determine. To find out the relevant 
information not retrieved, we have to go through the entire data base, 
unless we use sane of the less clunsy methods which, of course, are also 
less precise (see Project Report CXMP£NDIiX/THXT-PAC-CIS (1)}. h'e can save 
a lot of time if we compute recall only for the recall -oriented users. 
Additional effort can be saved if we detennine recall figures for the 
extreme values of relevance (both high and low), where we may anticipate 
the extreme opposite values for recall (highest for low relevance and 
lowest for high relevance); approximate recall values for other profiles 
can then be derived from the graph drawn between the extremities. 

If, for whatever reason, we are unable to establish recall values even 
in the time-saving ways described, we can still impiove recall for the 
local 1 -oriented user: we simply apply such of the measures listed in 

Section 2 of t’luptor V' as are most appropriate, if even then we do not 
succeed in obtaining a better recall, the data base may he regarded as 



unsuitable for the information request. This, of course, should have been 
ascertained before submitting the profile. 

If we proceed in the time-saving way described, above (i.e. we use the 
method of recall evaluation set forth in (1) , and take only the recall- 
oriented profiles, and consider only extreme values of relevance plus sane 
medium) we can plot a graph giving the approximate performance of the system. 
By means of this graph we can roughly estimate other recall values from the 
assessed relevance values, and vice versa. Supposing we have established 
the performance as depicted in the Fig. 2, we could roughly derive, that 
the recall values l'anging from 17% to 341 might be anticipated for the 
relevance of 90%. With the recall value of 34% we would be allowed to 
operate in the range of 32-90% relevance. 




Fig. 2 - System Performance Graph 



It remains to clarify the relationship between, on the one hand, high 
and low output for a profile, and on the other hand, relevance and recall. 

High output may be indicative of a higher recall. Usually, some general 
terms and a very simple s tincture of the search expressions are responsible 



for th i ^ type of result. Unless these terms were deliberately chosen, low 
relevance may be expected in this case. 

A low output may be caused either by a low a priori relevance of the 
data base (unsuitable data base for this profile) or by faulty question 
formulation. In the first case both relevance and recall may be adequate; 
in the second case, no definitive statement can be made before evaluating 
relevance, but low recall is likely. 



V MEANS TO BRING ABOUT THE DESIRED OBJECTIVES 



We should now have reached the point where we know the user's exact 
information need, the performance of his pi'ofile and his preference for 
relevance or recall. 

Let us now examine the ways in which we can optimize system performance 
and thereby improve user satisfaction. 

(1) Ifow to Increase Relevance 

There may be some situations contributing to a low relevance which we 
cannot influence. For example, we may find an abstract which appears to be 
pertinent, but the hard copy will prove to be irrelevant. The abstractor 
is responsible for this type of error. He is also responsible if the terms 
used in a data-base record are too general . 

Here are the things the search editor can do to increase relevance: 

1. Increase the exhaustivity of the query. ("Exhaust ivity" means 
full, complete, comprehensive definition of an information need.) 

Suppose that a user needs information regarding machinery for 
cutting plastic foams, especially dimensions, prices, production rates, 

I tower consumption, working width, automation features. If he does not 
specify all facets (categories) such as material (plastic, synthetic, 



polyurethane), machinery (cutting machine) , product (foam), additional 
features (automation), he may get rather poor relevance. 

The search editor should ask the user to add any missing categories 
and words, to ensure what is called a "multiple approach." 

It should be noted, however, that enhancing the exhaustiveness of 
a profile to promote relevance must be done inside one or more search ex- 
pressions and not by adding more search expressions. The reason is that 
search expression (CON) is the entity being searched as a unit. 

Remember, expressing the multiple approach by splitting the addi- 
tional terms into self-sufficient search expressions will not provide a 
smaller amount of more relevant information, but a higher output with an 
improved recall. 

In formulating a modified profile, a tw r o- or three-level structure 
should be applied to make a profile more exhaustive (for three-level logic 
structure see (1) ) . 



2. Increase the specificity of the query. "Specificity" in this sense 
means a precise definition of a subject using the proper hierarchical level (s). 
The following example will clarify what is meant. Suppose we want information 
regarding "thermoplastic resins." The hierarchical structure is (Fig, 3): 



I 

II 

III 

IV 



[chemistry* 




macrcmolecular chemistry 1 



/ I res ins 1 ^ \ ^ 



resins 




Tig. 3 - Hierarchical Structure 



It is evident that specifying the information need as "resins" 
brings too much noise into the output, whereas stating it as "polyvinyl- 
alcohol" would prevent other thermoplastic resins being retrieved for the 
user. 



When in doubt, state all hierarchical levels (resins, thermoplastic 
resins, polyvinyl alcohol, etc.) if it is important not to reduce recall. 

3. 'Jse "WITH" or "ADJ" where "AND" has produced a low relevance. 

Do not use "WITH" and "ADJ" indiscriminately. Use "AUJ" only if 
there is a fair chance of both words occurring close together in the order 
indicated; otherwise, use "WITH." 

Be sure to take this step only if the user clearly favours higher 
relevance and does not mind the lower recall that will result. 

4. Check whether some irrelevant information could not be traced back 
to the lack of some "capitalization" feature. 

Perhaps incorporating one or two "at" signs in the profile would 
forestall the selection of sane irrelevant documents as hits. If informa- 
tion concerning any acronym is wanted, be sure to use the appropriate 
capitalization, if the word is a contnon one in English. 

5. "CONTROL" logic can sometimes be helpful in improving relevance. 

An example of this might be limiting the search to the "author" print 
control, where "brown" or "smith" would cause irrelevant information to be 
retrieved in the abstract. Capitalization could do an equivalent job in 
this case. 

6. Consider using "NOT-CONTROL , " which may, in a limited number of 
search expressions, promote relevance. 

If, for example, the user is interested in "mining," the search 
slxxild not be conducted in the "citation" print control, as it would find 



any article in the journal "Coal and Mining," and others. 

7. Check any truncation in the profile. If truncation is done in a 
way that leaves the door open for irrelevant items, correct truncation. 

Remember, truncation must be tailored to measure. If too broad, 
it decreases relevance (e.g. "bio" adds "biochemistry," "biography" to the 
wanted "biology"}; if too narrow it decreases recall. 

8. Be careful in using a higher match criterion since this may 
drastically reduce output. The relevance would be improved only if the 
search expressions, which must be jointly matched, are related. Sometimes 
profiles are designed so that this indeed will be the case. Otherwise, 
increasing the match criterion is rather pointless and should be avoided, 
as it impairs recall. Three levels of vertical structure should suffice 
to identify the search expression; in rare cases where this is not so, a 
higher match criterion would be warranted. 

9. If an incorrect word has produced irrelevant information, consult 
the user or another expert in his subject area. This actually should not 
happen, because correct terms (along with synonyms, antonyms, acronyms and 
related terns) are the minimum we require from the users. Technical 
encyclopaedias, thesauri, etc., might also be useful (see COMPENDIA 
lhofiling Guide). Check also for possible errors in coding, typing and 
punching. 



10. Make sure the tenns state explicitly the concept desired. 
Relevance failures are sometimes caused by ambiguous tenns. 

Tor example the term "reduction," or any truncated form of it, 
may supply information dealing with "reduction" as a common type of 
chemical reaction, hut it will also include papers about cost reduction, 
price reduction, reduction valve, rcduct ion gear, reduction piece, reduced 
scale, reducing transformer, etc. 



So, if the answers are to be directed to cost reduction, specify 



"COSTS WITH REDUC$$$$ . " 



11. Hike sure there is no false coordination. 

For example, suppose we want information about information 
retrieval and about infonnation specialists using computers. If we 
formulate our question like this: 

A1 RETRIEVAL OR SPECIALISTS 
A 2 INFORMATION OR COMPUTERS* 

COM A1 WITH A 2 



we will be getting not only the desired infonnation, but also the undesired 
"computer specialist," owing to the false coordination. 

Such a construction must be restructured, e.g.: 

A1 INFORM VTI ON WITH RETRIEVAL OR SPECIALISTS 
A 2 COMPUTERS* 

CONI A1 AND A2 

12. If a certain formulation of any search expression produces 
repeated relevance failures, and no part of it can be left out, use the 
logic "NOT," e.g.: 



CONS NETWORKS 

C0N9 NOT 11130111$$$$$ 

(2) An Example of Improving Relevance 

The information need has been stated as follows: "I request an Sl)l 

•service in the field of wire production, especially casting. I am inter- 
ested also in the economical aspects of production. Production costs of 
wire are of interest as well (in any type of wire production)." 



Ik* profile was set up as sitown below: 



A1 WIRE 

A 2 MANUFACTUR$$$ OR CAS$$$$ OR PRODUC$$$$ 

OR COS?$ OR ECONCM$$$$ 

CONI A1 AND A2 

The output for this profile was excessive. The user declared himself 
to be relevance-oriented. 

We discussed the user's profile with him, and as a result made the 
following adjustment: 

1. General terns "manufacture" and "production" were supplemented by 
the more specific "casting," "prerolling," "rolling," "reeling," "winding." 

2. Exhaustivity of this search expression was enhanced by "aluminum" 
to define the type of material in which the user was actually interested. 

We found that only modern continuous methods were sought. 
"Continuous," therefore, further increased the exhaustivity of this search 
expression. 

3. The truncation CAS$$$$ was wrong and was responsible for retrieving 
irrelevant "CASE" and "CASING." It was corrected to CAST$$$. 

4. Some irrelevant information was a result of the marketing require- 
ment. Because of this we excluded "market," "sales," "prices," "trade," 

by means of the NOT logic. 

5. We tied the logic levels formed by using the more exacting and 
restrictive Willi in place of AND. 

The adjusted profile is designed to yield a high relevance: 




A l WIRE 

A 2 ALUMINUM OR ALUMINUM 



( 14 ) 



A3 MANUFACTUR$ $ $ OR PRODUC$$$$ OR CAS1’$$$ OR PREROLL$$$ 
OR ROLL$ $$ OR RliEL$$$ OR IVINL)$$$ OR F.OONOM$$$$ 

A4 COSTS 
A 5 CONTINUOUS 

CONI A1 WITH A2 WITH A3 WITH A5 
OON2 A1 WITH A3 WITH A4 

CON3 NOT MARKETS $$ OR SLL$$$$ OR SAIH$ OR PR1C$$$ 

OR TRADE 



(3) How Co Increase Recall 



1. One veiy comnon reason for finding little response in the data 
base to one's query is lack of multiple approach in the data base. The 
abstractor or indexer is at fault here. It simply meaas that the data- 
base record(s) have few "access points," so that the query, however well- 
framed it may be, caruiot hit the relevant abstracts easily. 

These low-quality abstracts will be discovered only by a thorough 
check of the data base. 

The only thing a search editor can do about this, is to formulate 
the query using ail facets (categories); the search expressions should be 
very simple to increase the chances of any of them hitting into this 
imperfect description of docunent content. In other words, we apply 
higher exhaustivity by using more search expressions (and match criterion 
equal 1, of course). 

Although these tactics are especially suitable for a daca base of 
a low exhaustivity, it will prove helpful any time there is a need to in- 
crease recall. 

At the same time, the existing search expressions should be made 
less complicated, if possible. 

2. Another way to improve recall is to increase synonymity in a 
question. 



Suppose, for example, we are seeking literature dealing with 
"search editors," and we are not content with the recall; we specify 

INFORMATION WITH OFFICERS OR SPECIALISTS OR SCIENTISTS 
along with 

SEARCH AUJ EDITS $$ 

The user should supply all of the synonyms. A user demanding 
information regarding "potash" can ensure a good recall only if he provides 
also the terms "potassium carbonate" and "K-COj." 

3. A low recall may result if we specify a subject by a term in only 
one hierarchical level, whereas if specified in two or more levels hits 
would be very likely. 

Referring to our example in the previous chapter, if our information 
need can be described as "thermoplastic resins," we are more likely to obtain 
a better recall by using the imnediately higher and lower hierarchical levels 
as well: 



RESINS OR POLYVINYLALCOHOL 

Notice that there is no need to specify 'THERM0P1ASTIC WITH RESINS," because 
"RESINS" covers them too. 

Using this type of specification yields lower relevance because of 
the more general term "resins," which will find other types of resins, too. 

4. Recall can be improved by simply using WITH instead of AIAJ , AND 
instead of WITH, or use AND only. 

If two words in the question do not match a pair of words close f 
together, tlioy have a better chance of matching than in a sentence or in 
the wiwle record. This, of course, does not apply to word groupings which 
occur only in one particular order (sequence). 
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5. Great care must be taken in using capitalization. Unless 
capitalization offers considerable benefit in terms of relevance, it is 
better to forget it. 

It is especially important to remember that acronyms may appear 
with only initial capitalization as well as with aJl letters in upper case. 
If in doubt, it is better to use one "at" sign rather than two. 

6. The "CONTROL" feature will very' clearly curb output for any profile 
and decrease recall. Therefore, do not use "CJCLNTROL" if it is desired to 
enhance recall, unless .t is required for some other good reason. 

7. "NOT-CONTROL" should be used judiciously, without preventing 
relevant information from being retrieved. 



8. It is important to ensure that any truncation used does not ex- 
clude any relevant information. Remember that "0RGANIZ$$$" covers 
"organize," "organizing," but not "organization," "organizations," "organi- 
zational." Think of all forms possible. If indoubt, use unconditional 
truncation. 



9. If higher recall is sought, no match criterion greater than l 
should be used. 



10. Words in the profile should be correct as they are supplied by the 
user. Should it happen by some accident, that they arc not, both relevance 
and recall will be poor. Urge the user to replace them by proper terms. 

Misspellings, errors in coding, typing and punching also belong to 
this category, because they produce words that camiot be matched in the 
desired way. 

Check the entire wording of the profile print-out carefully. 

11. \nv "NOT" logic used must not be excessively restrictive, lor 
example, in 

ERJT 



( 17 ) 



COM 14 CARS OR VEHICLES 
CONI 5 NOT MARKLT$$$ 



this "NOT" would be too restrictive if information about prices of cars is 
also required. 

In most cases the "ABS" logic is very helpful, if it is necessary 
to override any limitation. Suppose information is wanted about agricul- 
tural products, but there is enough about barley, rye and oats. If the 
formulation 

CONI AGRICULTUR$$ WITH PR01XJC$$$ 

00N2 NOT BARLEY OR RYE OR OATS 

does not give a good recall for wheat, specify 

CONI AGR1CULTUR$$ AND PRODUC$$$ 

COM2 NOT BARLEY OR RYE OR OATS 

COM3 ABS WHEAT 

In some cases we cannot help getting a lot of irrelevant material 
if we want to ensure that certain specific information is picked up. This 
holds true with general terns such as "profile," "question," etc. 

(4) An Example of Increasing Recall 

Assume w r e have the following request: "I need, for a research project 

I am working on, any information pertaining to the continuous dyeing of 
synthetic fibers at high temperatures without pressure. I am especially, 
though not exclusively, interested in the dyeing of polyester fibers. I 
liave heard that the company SYNDYE has been marketing for this purpose 
some special types of dyestuffs with good dyeing properties and this infor- 
mation would he of high interest." 

The profile was formulated like this: 



MATCH CRITERION * 2 

A1 SYNTHETIC A1U FIBER 

A 2 lOLYllSTLR 

A3 HIGH /MU T1MPLRA11IRE 

A4 KiTlIXff AIU PRESSURE 

AS CONTINUOUS AIU DYEING 

COM A1 AND A 2 AM) A3 AM) A4 AM) AS 

CON 2 SYD.VYE WITH TOLYtSTliR WITH DYEING 



The user did not get any hi t ^ for this profile. As the user is recall- 
oriented, we had to try hard to satisfy him before losing him as a subscriber 
to the service. Kc did the following: 

1. Having due regard for what was explained, we first of all set the 
natch criterion e 1. 

2. Ke then talked to the user and learned that '’FIBER” may be spelled 
also as "ftRRE.” 

3. Ke found that synon\tnous and related to "SYNTHETIC FIBER” are 
"ARTIFICIAL,” ’MltMlCAL," "PIASTIC," "NIK.” "KIT! OUT AIU PRESSURE” could 
also be stated as ”L0h AM FRESSURE," ”NC*MAL AIU PRESSURE.” 

4. Ke decided to use truncation in the terms ’’FIBERS," "POLYESIERS," 
"TIMFERATURE*” and to use DYES$S$<$ to cover DYEING, HIT, DYES, DYESTUFF, 
DYESTUFFS as veil . 

5. Ke discovered one Misspell ir^ which had occurred in coding the 
profile: SYMA’E should be used instead of SYDNYE, 

6. Instead! of using one search expression ("exhaust ivily in one 
expression”) we turned to more and simpler search expressions f’alaustivity 
in nore expressions”) to apply a multiple approach. 

?. Kc replaced ADI by KITH in "SYNTOTIC AID FIBER” and ATU hv AM) 

In *\'*)\TINh'5b» Mil cYJ IV " in view cf Jtc possible word sequence; for 



example, we could meet in the data Iwse "FIBERS MADE OF THE NEW SYNTHETIC 
RVVMUCRIAES," which would not lie matched by using t ho initial formulnt ion. 

8. Tlie user has rounded out hi s request stating tluit not only fibers, 
but also yarns, fabrics and knitted goods were of interest. 

The adjusted profile has the following structure: 



MITCH CRITERION- ■= 1 

At swim-lie or artificial or domical or mw or plastic 

OR flYLYliS’ LH$ WITH FIBRES OR FIBERS OR YARNS 
OR FABRIC* OR HOSIERY OR KNITSSS 
A 2 Ilia I TEMPERATURE 

A3 ivimxrr or low or .votmae aiu prissure 

AT CONTINUOUS 

AS HYF4SSSSS 

OON1 A1 AND A2 AND AS 

OON2 A l AND A3 AND AS 

CONS \1 AND AT AND AS 

A6 SYNDYE 

A 7 Wl.YESTER 

CON4 AS ANIl A6 AND A7 

This formulation offers a very good recall with a fair relevance. 
It allows even an extrene recall to be obtained, if we use combinations of 
only two logical symbols in the search expressions. Relevance, however, 
would be impaired very substantially in this case. 



VI OONCLHS ION 

This manual is intended to fill the gap between our Profiling CtiiJe (2) 
uxl tlie Piogress Report (11. As the logical extension of the former it uses 
tlx* knowledge gaimxl in atoning the latter. 



.Vs ve have seen, ther** are various tools available to the search 



editor for monitoring in the desired way the output, he receives for pro- 
files submitted. 

For the search editor to take full advantage of these tools, he must 
not only know them, but be capable of selecting the proper means in each 
particular case for achieving the desired goals. 

The prerequisite, however, is to ascertain the user's real information 
need, and whether he is recall- or relevance-oriented. The proper ways to 
do this have been shown. 
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